Draft version January 31, 2013 

Preprint typeset using I^^'T^]X style emulateapj v. 5/2/11 



OPENING THE 21CM EOR WINDOW: MEASUREMENTS OF FOREGROUND ISOLATION WITH PAPER 

Jonathan C. Pober\ Aaron R. Parsonsj^, James E. Aguirre^, Zaki Ali\ Richard F. Bradley^"'''', Chris L. 
Carilli"^', Dave DeBoer^, Matthew Dexter", Nicole E. Gugliucci"', Daniel C. Jacobs*, Dave MacMahon'', Jason 
Manley*^, David F. Moore^, Irina I. Stefan"^ William P. Walbrugh'^ 

Draft version January 31, 2013 

ABSTRACT 

We present new observations with the Precision Array for Probing the Epoch of Reionization (PA- 
PER) with the aim of measuring the properties of foreground emission for 21cm Epoch of Reionization 
experiments at 150 MHz. We focus on the footprint of the foregrounds in cosmological Fourier space to 
understand which modes of the 21cm power spectrum wiU most hkely be compromised by foreground 
emission. These observations confirm predictions that foregrounds can be isolated to a "wedge" -Uke 
region of 2D (fcj^, fc|[)-space, creating a window for cosmological studies at higher /cy values. We also 
find that the emission extends past the nominal edge of this wedge due to inherent spectral structure 
in the foregrounds themselves, with this feature most prominent on the shortest baselines. Finally, we 
filter the data to retain only this "unsmooth" emission and image it. The resultant image shows an 
excess of power on large angular scales, but no emission can be clearly localized to any one region of 
the sky. This image is highly suggestive that the most problematic foregrounds for 21cm EoR studies 
will not be easily identifiable bright sources, but rather an aggregate of fainter emission. 
Subject headings: cosmology: observations — dark ages, reionization, first stars — techniques: inter- 
fcromctric 



1. INTRODUCTION 

The highly redshifted 21cm line of neutral hydrogen 
is widely regarded as one of the most promising probes 
of the high redshift universe, with potential to map out 
volumes extending from redshift '-^ 1 through the Epoch 
of Reionization (EoR) to the dark ag es at redshift 20 
and beyond (for revie ws of the fi eld, see Furlanetto et al. 
2006 , .Morales fc Wyithe||2010l and [Pritchard k Loeb 
20121.^ Numerous facilities and experiments targeting 



the signal from the EoR are already online or under con- 
struction, including th e LOw Frequency ARray (LOFAR; 
Yatawatta et a l. 2013 the Murchison Widefield Array 
(MWA; |Tingay et al. 20i2| pj and the Donald C. Backer 
Precision Array tor ProbTng th e Epoch of Reionization 
(PAPER; [Parsons et "al]|2010[p| All 21cm cosmology 



experiments will need to separate bright galactic and ex- 
tragalactic foregrounds from the neutral hydrogen signal, 
which can be fain ter by as much as 5 ord ers of magnitude 
or mo re (see, e.g., Furlanetto et al.|2006 and Santos et al. 
^005| . 

Considerable effort has been devoted to developing 
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sche mes to remove or isolate foregrounds from 21cm data 
(e.g. 'Morales et al.'"2006', 'Bowman et al."2009; 'Liu et al 
2009, Liu & Tcgniark 2011, Parsons ct al. 2012b, Dillon 
et al.||2012p . Almost all of these approaches rely on the 
spectral smoothness of foreground emission relative to 
the 21cm signal, which will contain significant structure 
versus frequency. The purpose of this lett er is to use the 



delay transform technique presented in (Parsons et al. 



2012b hereafter, P12b) on observations from PAPER to 



test the behavior of actual foreground emission. We wish 
to understand the footprint of foregrounds in fc-space to 
determine which modes of the 21cm power spectrum will 
be most accessible to observation. The structure of this 
letter is as follows: in |2j we describe the data used in 
these observations. In Sjsf we review the delay spectrum 
technique presented in P12b, and then describe the steps 
used in applying this approach to actual observations. 
We present our results in f|4]and conclude in ^ 

2. THE DATA 

We use 4 hours of data collected in 10 second inte- 
grations between JD 2455747.48 and 2455747.64 (4 -5 
July 2011), using a 64 element PAPER array located 
on the SKA site in the Karoo region of South Africa. 
This data set comes from t he sam e observing campaign 



described by 'StefanetaD (2012), although this specific 
4-hour window falls outside ot the observations analyzed 
therein. The dipole antennas are arranged in a "mini- 
mum re dundancy" c onfiguration optimized for imaging 
analysis (Parsons et al. 2012a), which is shown in Figure 
[l] This configuration has a maximum baseline length of 
~ 300m, corresponding to a maximum image plane reso- 
lution of 0.4° at 150 MHz. The PAPER correlator has a 
100 MHz instantaneous bandwidth from 100 - 200 MHz 
divided into 2048 frequency channels. We correlate only 
one linear polarization on each dipole, and discard all 
data from antennas 40 and 55 which were cross polar- 



2 



c 
o 

4-1 
'(/I 

o 

m 
c 
c 

0) 

-M 

c 

< 



150 
100 

50 




£ -50 



o 



-100 



-150^ 



50 -100 -50 50 100 150 

East-West Antenna Position (m) 



Figure 1. The configuration of tlie 64 PAPER dipoles used in 
tliis analysis. Tiie zero-point is tiie center of tiie array. The y- 
axis is North/South, the x-axis is East/West, and distances are in 
meters. 

ized during the time of the observation. 

An image of the field transiting during this time pe- 
riod is shown in Figure [2] This image spans a 25 MHz 
band spanning 140 - 165 MHz; 100 sub-bands of 0.25 
MHz were individually imaged and summed to make the 
map shown. No CLEANing or spectral slope correction 
was performed, and the visibilities are weighted natu- 
rally. The observation is centered on a low-foreground 
"cold patch," a potential field for an EoR science obser- 
vation. The Galactic plane, which is just setting at the 
end of the 4-hour observation, creates prominent side- 
lobes over the entire map. 

Complex antenna based gains are derived from fringe- 
fitting to Centaurus A, Pictor A, and Fornax A; an over- 
all gain scale is derived from the Helmboldt et al. ( 2008 1 
source J2214-170. The details of this calibration will be 
presented in a forthcoming paper by Parsons et al. We 
also perform minimal preparatory processing on the data 
for the power spectrum analysis. These steps include a 
small gain linearization correction to mitigate da ta quan- 
tization eff ects in the correlator (described in Parsons 



et al. 2010 ) and a corre ction for ternperat ure dependent 
gam aritts (described in Pober et al.|2012 ). We then per- 
form a radio-frequency interference (Kl'lj excision, man- 
ually flagging frequency channels of known transmitters, 
and flagging any points 6a above the mean after differ- 
encing adjacent channels along the frequency axis. 

3. ANALYSIS TECHNIQUES 

At the core of our analysis is the delay spectrum tech- 
nique presented in P12b. Since cosmological redshifting 
maps the observed 21cm line frequency into a distance 
measurement, the Fourier transform of the frequency 
axis are the line-of-sight modes of the 21cm power 
spectrum. However, the frequency dependent sampling 
pattern of an interferometric baseline also turns the fre- 
quency Fourier transform of a single baseline's visibility 
spectrum into the "delay transform" first presented in 
Parsons fc Backer (2009). The delay transform maps 
sources of celestial emission to Dirac delta functions in 



delay space corresponding to the geometric delay in ar- 
rival time between the two elements of a baseline. 

The power of the delay transform in the cosmologi- 
cal context is the framework it provides for understand- 
ing the area of fc-space affected by foreground emission. 
Since geometric delays are limited to values below the 
physical light travel time distance between the two anten- 
nas (the "horizon limit" ) , all emission from the sky maps 
to a small region in the center of delay space. However, 
any other spectral structure in the visibilities acts as a 
convolving kernel for the celestial emission. Foreground 
emission is spectrally smooth, translating into a narrow 
convolving kernel; 21cm emission has large amounts of 
spectral structure, and therefore its kernel scatters power 
from within the horizon limit to larger delays. P12b also 
showed that delay has a near one-to-one mapping to /cj| , 
meaning both that foreground emission is confined to 
large scale modes along the line of sight and that those 
delay modes free from contaminating foreground emis- 
sion are effective probes of the 21cm power spectrum. 

3.1. Delay Space CLEAN 

Some of the practical aspects of implementing the 
delay-spectrum approach in actual data were described 
in §3 of P12b. Of particular importance is the imple- 
mentation of the frequency Fourier transform using a 
Blackman-Harris window function and the ID-CLEAN 



algorithm first presented in Parsons & Backer ( 2009 ) to 
reduce the effects of RFI flags and band-edge effects. 
Even if foregrounds are spectrally smooth and easily lo- 
calized in delay/fc||-space, such sharp edges in frequency 
space will scatter power from foregrounds into otherwise 
uncontaminated regions of /c-space. 

The ID CLEAN algorithm treats RFI flags as a sam- 
pling function in frequency space, and "fills in" these 
gaps by iteratively fitting the brightest Fourier compo- 
nents in the delay domain. We force our model to be 
smooth-spectrum in frequency by only allowing delay- 
space components which fall inside a baseline-dependent 
CLEAN area (a "CLEAN box"), 50 ns beyond the phys- 
ical maximum horizon delay on that baseline. This ex- 
tra 50 ns allows CLEAN to model foreground emission 
pushed beyond the horizon limit. The exact value of 
50 ns is chosen to minimize the amount of foreground 
emission remaining in the residuals, so that the sidelobes 
of any remaining flux scattering off RFI gaps are below 
the noise. (While the foreground "model" is free of RFI 
gaps, the residuals will not be.) The analysis below does 
reveal emission extending more than 50 ns beyond the 
horizon limit, and PI 2b predicts that at the level of the 
EoR signal, foreground emission can extend as much as 
300 ns beyond the horizon. At the sensitivity level of this 
work, however, we find 50 ns to suppress the sidelobes of 
foreground emission in our "EoR" window to the noise 
level. The exact value of the cut-off chosen will need to 
be revisited in future analyses with more sensitivity. 

The end result of CLEAN is a smooth-spectrum model 
of our data which is free of RFI flagging gaps. We also 
retain a data-set consisting of the residuals between this 
model and the raw data to ensure that no information is 
lost. 

3.2. Power Spectra 
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Once the data have been CLEANed, is it quite straight- 
forward to form power spectra on a visibility-by-visibihty 
basis. Our power spectrum estim ates follow from equa- 
tion (12) of " 



Parsons et al. (2012a 



Figure 2. A dirty image of the data used in tliis analysis, centered on RA 211i52m and declination -30°43', the transiting zenith halfway 
through the observation. Prominent sidelobes from the Galaxy are seen in the right-hand side of the map. Close-ups show Cygnus A at 
19h59m and +40°44' and the point source population of a potential EoR cold-patch. The color-scale is linear in Jy, with only the brightest 
point sources saturating the range. 

note that the method used here does not take advantage 
of any coherent integration within a ww-pixel. Since fore- 
ground emission dominates the observed signal, the loss 
of sensitivity is not important, and we ignore this effect 
for computational efhciency. 

4. RESULTS 

To make power spectra of the 4-hour datasct de- 
scribed above, we first run our CLEAN algorithm over 
the full 100 MHz band. Using the whole band gives 
the best resolution in delay space, and since foregrounds 
are nearly coherent over the whole band, the additional 
information gives CLEAN the most signal-to-noise to 
build its model. For the cosmological delay transform 
to make power spectra, we use only a 25 MHz band from 
140 to 165 MHz. This smaller band still exceeds the 
~ 8 MHz band over which the z ^ 10 universe can be 



P{k) « V, 
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2fcB 



(1) 



where A is the observing frequency, is Boltzmann's 
constant, fl is the solid angle of the primary beamp^ 
B is the observing bandwidth, X and Y are cosmologi- 
cal scalars which convert observed angles and frequencies 
into /iMpc~^, and is a delay transformed visibility. We 
avoid introducting a noise-bias by forming our estimator 
V21 from adjacent time samples on a baseline. The 10- 
second interval between integrations is short enough that 
both measurements can be considered redundant samples 
of the same /c-modes. 

The isotropy of the universe allows us to then combine 
all power spectrum estimates V21 in annuli of equal k± 
to form a 2D power spectrum in the {k±, fc||)-plane. We 



Deriv ations of equat i on [l] in [Morales] l|2005| , [McQuinn et al.] 
([2OO6), and Parsons et ah] l |20'r2a| , relate ilB to an ettective cosmo- 
logical volume, using a top-hat primary beam or effective area as 
a pedagogical tool to simplify the result. More generally, however, 
the effective O in equation 111 is actually J A^{9,ij)) <i9 d<j>, where 
A{9, 4>) is the power response of the primary beam. A derivation of 
this effect will be presented in a subsequent full length publication 
by Parsons et al. For PAPER, this beam is a factor of ~ 2 smaller 
than ^ A{e,<t>) d6» Atj,. 



treated as coeval (Furlanetto et al. 2006). Since the main 
purpose of this work is to understand the fc-space behav- 
ior of foregrounds, we ignore this effect, as the additional 
bandwidth gives us better A:-space resolution. For an 
EoR analysis, the isolation of foregrounds will be slightly 
worse than shown here, owing to the necessarily poorer 
resolution. 

Forming individual power spectra from each baseline 
of the array and binning in annuli of constant! fcj^ | yields 
the 2-dimensional P{k±, k^^) shown in Figure p| The bin- 
ning in jfcj^l is very minimal; most bins contam only one 
baseline and gaps are still visible (especially at the small 
/c_L-values) where there are no baselines of that length. 



P{k) [mK^ih-^Mpif 




We do no binning in ; this resolution is set by the 
25 MHz bandwidth used in the analysis}^ The most 
prominent feature is the "wedge" -hke shape of the fore- 
ground emissis on as predicted in a number of s tudies , 
including Datta et all (|2010|, jVedantham et"al] (|2012[) 



0.06 0.08 

Figure 3. A two-dimensional power spectrum of the 4 hours of data analyzed. The wedge-like nature of the foreground emission is 
extremely prominent. The white line marks the horizon limit and the orange line is 50 ns beyond. Binning is minimal; each k± values 
corresponds to at most a few baselines, with gaps caused by a lack of a baseline with that magnitude. The "edge brightening" of the 
foreground emission on the longest baselines as one moves near the horizon limit can be attributed to the Galactic center, and moves as 
expected when the data are viewed as a function of time. 

zon limit moves to higher A:|| values for longer baselines. 
Therefore, it is easier for a fixed width kernel of unsmooth 
emission to corrupt supra-horizon delays on shorter base- 
lines than on longer. 

To highlight the steepness of the foreground roll-off, 
we plot a 1-dimensional fc|| power spectrum for bins of 
several baseline lengths in Figure |4] We see that the 
foreground emission can fall by as much as three to four 
orders of magnitude in a factor of 2 change in fcy . It is 
difficult to explicitly compare this result to the predic- 
tions of P12b, due to the different resolutions and bin- 
ning used. The placement of bin edges can significantly 
complicate comparison when the fall-off is so steep, as a 
slight shift in the bin can result in a large change in the 
average value within. For similar reasons, it is difhcult 
to say exactly where the foreground fall-off falls below 
the noise. Given these caveats, there is nothing in this 
data to contradict the predictions of P12b. It is clear, 
however, that sensitivities will have to increase several 
more orders of magnitude before anything can be said 
about the behavior of foreground emission at the tens of 
mK^ level where the expected EoR signal lies. 

To further investigate the nature of the supra-horizon 
emission, we filter the data in delay-space, selecting only 
the delay bin just beyond the 50 ns cutoff. That is to say, 
we image the emissio n from the bin lying just above the 



Morales etar[ (2012), Trott et al. (2012), and fl2U~As 
argued in Jr'12b, this "wedge" footprint in fc-space is not 
a result of imperfect calibration in foreground removal 
(since we attempt no foreground removal in this work), 
but a property of the emission itself as measured by an 
interferometer . 

Also of note are the two diagonal lines plotted on top 
of the power spectrum. The white line corresponds to 
the horizon-limit in fcy for a baseline of corresponding 
length k±; the orange line is 50 ns beyond the hori- 
zon, corresponding to the CLEAN region used in our 
deconvolution. As predicted, emission extends beyond 
the horizon limit due the intrinsic spectral structure of 
the foreground emission. 

We draw attention to the fact that the supra-horizon 
emission does not have a constant width in ku as a func- 
tion of k±. Rather, more emission extends beyond the 
horizon on the smallest /cj^-values (i.e. the shortest base- 
lines). The likeliest explanation for this result is a com- 
bination of two different effects. First, the shortest base- 
lines will "see" more total emission from the sky, since 
they resolve out less of the diffuse Galactic synchrotron. 
Therefore, the sidelobes of this emission will be brighter 
as well, so that we can see their extent further in fcy be- 
fore they fall below the noise level. The second effect is 
somewhat more subtle. A given amount of spectral "un- 
smoothness" corresponds to a kernel with a fixed width 
in delay (and therefore, /cy) space. However, the hori- 

Because of the Blackman-Harris window function used in 
CLEAN, only every other sample plotted in fcy is statistically in- 
dependent. 



orange line in Figure |3[^"| The resultant map is shown 
in Figure [5] None of the bright point sources in Figure 

t remain visible, nor does any emission associated with 
e Galactic plane. Closer inspection reveals low spatial- 



^® While we might ideally want an image of all the emission 
above the horizon-limit, a full data-cube versus frequency would 
be required to observe all the different fcy modes. Given the short 
observation time, this cube is too noisy; instead, we image only 
one mode per-baseline, which allows us to take advantage of the 
full band's sensitivity. 
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Figure 4. A 1-dimensional power spectrum versus fcy for bins of several baseline lengttis. Labels correspond to the maximum lengtli 
baseline in the bin, e.g., the 16- wavelength bin contains baselines from 8 to 16 wavelengths. To preserve the steep roll-off of the foreground 
emission, the data are plotted with their natural resolution at lower fcn values, and logarithmicallv binned at hisher values. 
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Figure 5. A dirty image of the data, after selecting only th delay bin just past 50 ns beyond the horizon limit (i.e. one bin beyond the 
orange line in Figure [3|. The color-scale is linear in Jy. The dominant feature in the map is an excess of emission above the noise on large 
spatial scales. No emission can be identified with the bright sources in Figure [2] nor are the large scale features obviously correlated with 
Galactic structure. 



frequency ripples across the maprj 

We can confirm the existence of this excess power by 
comparing the rms of the image to an estimate of the 
expected noise level in the map. Using parameters for a 
4 hour observation at 150 MHz with a 25 MHz bandwidth 

The use of natural weighting as in Figurc|2|will lead to a scale 
dependent noise across the image; to correct for this effect. Figure 
[5]was made using uniform weighting of the visibilities, and the low 
spatial-frequency power is still visible. 



and 62 antennas gives: 



Act 



A2 



y/N{N-l)Bt 



14 mjy, 



(2) 



where is Boltzmann's constant, O is the solid angle of 
the primary beam, A is the observing wavelength, Tgys is 
the system temperature, N is the number of antenna in 
the array, B is the bandwidth, and t is the observing time 
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([Thoinpson et al. [2007| ); fiducial values for PAPER are 
il = U.75 sr and Tsys = 1000 K. The exact value of Tgys is 
likely too high by as much as a factor of ~ 2, but in this 
case, using a high value for Tgys makes for a conservative 
estimate. A more thorough discussion of Tgys for PAPER 
will be presented in a forthcoming publication by Parsons 
et al. The measured rms from ^ 15,000 independent 
measurements near the center of Figurejs] (where primary 
beam effects can be ignored) is 21 mJy, which supports 
our hypothesis that the structure in Figure [5] is real. 

Given the k± dependence of the emission above the 
orange line in Figure [31 it is not surprising that ex- 
cess power is seen on Targe angular scales. We have 
already argued above why we might expect such a re- 
sult in the power spectrum domain. What is surprising 
is that none of this emission can be easily associated 
with the brightest structures visible in the image do- 
main. It appears that the bulk of the emission contami- 
nating the EoR window comes not from easily localizable 
sources or the Galactic plane itself, but from an aggre- 
gate of fainter emission. However, we caution against 
over-interpretation of Figure [5j some or all of this supra- 
horizon emission could be created by frequency structure 
in the PAPER instrument itself. The frequency structure 
in both simulated and calibrated models of the P APER 
system ( [Parsons et al. 2010[ Pober et al. 2012) would 
be a negligible factor iri pushing smooth-spectrum emis- 
sion beyond the horizon (P12b), but we cannot rule out 
the possibility that the instrument has additional yet- 
unmeasured frequency structure. 



5. CONCLUSIONS 

We have presented new observations from PAPER 
measuring the properties of foreground emission in cos- 
mological Fourier space. These observations have con- 
firmed general predict ions presented in, e.g., |Datta et al.| 
( [20 10[ ), [Morales et al.[ ^12) and P12b: that foreground 
emission occupies a wedge" in the 2D (fc_L,fc||) plane, 
leaving a window at higher fcy values for 21cm EoR stud- 
ies. We have also confirmed that shorter baselines yield 
a larger window onto the cosmological signal. However, 
this the window does not grow perfectly linearly with 
decreasing baseline length. A combination of the in- 
creased brightening of diffuse Galactic synchrotron on 
the shortest baselines and the size of a given delay-space 
kernel relative to sky-delays both lead to excess encroach- 
ment past the nominal "horizon-limit" . Therefore, while 
shorter baselines do make the best probes of the EoR 
signal, there will be diminishing returns at the shortest 
baselines. 



We have also presented an image of the supra-horizon 
"unsmooth" emission extending past the nominal edge 
of the wedge. This image confirms the existence of ex- 
cess power on the shortest baselines in the array, but is 
unable to localize any of the emission to known sources 
on the sky. This result is highly suggestive that a tar- 
geted removal of the brightest sources of emission may 
have only a minor effect on increasing the size of the EoR 
window. 
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